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Abstract 

We report the identification, cloning and tissue distributions of ten novel human genes encoding G protein-coupled receptors (GPCRs) 
GPR78, GPR80, GPR81, GPR82, GPR93, GPR94, GPR95, GPR10J, GPR102, GPR103 and a pseudogene, tyGPR79. Each novel orphan 
GPCR (oGPCR) gene was discovered using customized searches of the GenBank high-throughput genomic sequences database with 
previously known GPCR-encoding sequences. The expressed genes can now be used in assays to determine endogenous and pharmacolo- 
gical ligands. GPR78 shared highest identity with the oGPCR gene GPR26 (56% identity in the transmembrane (TM) regions). ij/GPR79 
shared highest sequence identity with the P2Y 2 gene and contained a frame-shift truncating the encoded receptor in TM5, demonstrating a 
pseudogene. GPR80 shared highest identity with the P2Y t gene (45% in the TM regions), while GPR8I, GPR82 and GPR93 shared TM 
identities with the oGPCR genes HM74 (70%), GPR17 (30%) and P2Y 5 (40%), respectively. Two other novel GPCR genes, GPR94 and 
GPR95, encoded a subfamily with the genes encoding the UDP-glucose and P2Y 12 receptors (sharing >50% identities in the TM regions). 
GPR101 demonstrated only distant identities with other GPCR genes and GPR102 shared identities with GPR57, GPR58 and PNR (35-42% 
in the TM regions). GPRI03 shared identities with the neuropeptide FF 2, neuropeptide Y2 and galanin GalRl receptors (34-38% in the TM 
regions). Northern analyses revealed GPR78 mRNA expression in the pituitary and placenta and GPR8J expression in the pituitary. A search 
of the GenBank databases with the GPR82 sequence retrieved an identical sequence in an expressed sequence tag (EST) partially encoding 
GPR82 from human colonic tissue. The GPR93 sequence retrieved an identical, human EST sequence from human primary tonsil B-cells and 
an EST partially encoding mouse GPR93 from smairinlestinallissue~.77^ 

thalamus of brain while GPR95 was expressed in the human prostate and rat stomach and fetal tissues. GPR101 revealed mRNA transcripts 
in caudate putamen and hypothalamus. GPR103 mRNA signals were detected in the cortex, pituitary, thalamus, hypothalamus, basal 
forebrain, midbrain and pons. © 2001 Elsevier Science B.V. All rights reserved. 
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1. Introduction 

As is frequently stated, GPCRs are the largest family of 
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cell surface receptors and are responsible for the signal 
transduction for a diverse variety of ligands including 
nucleotides, biogenic amines, peptides and other small 
molecules (Marchese et al., 1999). GPCRs share a common 
heptahelical topography and these regions are embedded in 
the membrane. These seven transmembrane (TM) regions 
share the most significant levels of receptor identity. As a 
consequence, the majority of DNA sequences encoding 
GPCRs were found using methods dependent on sequence 
homology, mainly PCR or electronic sequence database 
screening (Marchese et al., 1998). GPCRs activated by simi- 
lar ligands share the greatest identities with each other. 
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However, newly discovered GPCRs frequently have only 
distant identities with known GPCRs, and these oGPCRs 
are difficult to characterize given the diversity in structure 
of the ligands and effector systems. This problem is 
compounded as we now realize that many endogenous 
ligands remain to be discovered. Increasingly, oGPCR char- 
acterization has utilized methods of 'reverse pharmacol- 
ogy', using the receptor as bait to retrieve ligands from 
tissue extracts. These efforts have identified endogenous 
ligands such as apelin, ghrelin, melanin-concentrating 
hormone, neuromedin U, the orexins, urotensin-H, and 
UDP-glucose (recently reviewed in Lee et al., 2001a; Civelli 
et al., 2001; Howard et al., 2001). 

Approximately 250 mammalian genes encoding family 
A (or rhodopsin-like) GPCRs have been cloned (Lee et al., 
2001a). As yet, the total number of cloned GPCRs reported 
in the literature including the secretin and rnetabotropic 
glutamate-like families of GPCRs falls short of the 
projected 616 GPCR-encoding sequences observed from 
the complete human genome sequence (Venter et al., 
2001). Despite the human genome sequencing efforts, 
much work is still required to identify and clone the 
open reading frames (ORF) encoding the full complement 
of GPCR genes. Inserted into suitable expression vectors, 
these DNA sequences can be used to express the receptor 
in assays which will assist in ligand identification. For 
these reasons, we are continuing in our efforts to identify, 
catalog, compare and map the expression of GPCRs. We 
have recently reported the identification of the H4 hista- 
mine receptor (Nguyen et al., 2001), the cysteinyl leuko- 
triene 2 receptor (Heise et al., 2000) and the oGPCR- 
encoding genes GPR26, GPR57, GPR58 (Lee et al., 
2000), GPR61, GPR62, GPR63 and GPR77 (Lee et al., 
2001b). We now report the cloning of ten additional 
oGPCR-encoding genes named GPR78, GPR80, GPR81, 
GPR82, GPR93, GPR94, G PR95 , GPRJ0J, GPRWl ^d 
GPR103 as well as a pseudogene ipGPR79. GPR78 and 
GPR81 most closely resemble the oGPCR genes GPR26 
and HM74, respectively. GPR80, GPR93 and tf/GPR79 
shared highest identities with members of the purinoceptor 
family, while GPR82 encoded an oGPCR distantly related 
to the purinoceptor-like oGPCR genes GPR17 and GPR34. 
In addition, two novel genes GPR93 and GPR94 share 
significant identities with each other and with recently 
identified genes encoding the UDP-glucose (Chambers et 
ah, 2000) and platelet ADP (P2Y 12 ) receptors (Hollopeter 
et al., 2001 ; Zhang et al., 2001), which together comprise a 
clustered family of genes on chromosome 3. GPR101 
shared distant identity with amine-binding GPCR genes, 
GPR102 shared identity with the PNRIGPR57IGPR58 
amine receptor-like subfamily of GPCR genes and 
GPR103 shared identities with peptide-binding receptors, 
including the neuropeptide FF 2, neuropeptide Y2 and 
galanin GalRl receptors. We have also detected mRNA 
transcripts in tissues for GPR78, GPR8J, GPR94, 
GPR95, GPRJ01 and GPR103. 



2. Materials and methods 

2.7. Database searching 

We queried the expressed sequence tag (EST) and high- 
throughput genomic sequences (HTGS) databases main- 
tained by the National Center for Biotechnology Informa- 
tion with the amino acid (aa) sequences of various GPCRs 
using the TBLAST algorithm (Altschul et al., 1997). 
Retrieved sequences having statistically significant scores 
were further examined. The conceptualized protein 
sequences encoded by these sequences were used to query 
the non-redundant (nr) database to determine whether these 
sequences encoded previously known GPCRs. 

2.2. GPCR gene and cDNA cloning 

GPR78 was originally obtained in two fragments in an 
HTGS sequence from human chromosome 4 (GenBank 
Accession number: AC007104) which encoded the start 
methionine to the third intracellular loop (IC3) and from 
the carboxyl region of TM6 to the stop codon. Based on 
this sequence, two DNA fragments encoding from TM1 to 
TM4 and from TM6 to the stop codon were amplified from 
human genomic DNA. PCR products were extracted with 
phenol and chloroform, precipitated with ethanol and elec- 
trophoresed on a low-melting point agarose gel. Products in 
the expected size range were ligated into the EcoRV site of 
pBluescript SK(-) (Stratagene, La Jolla, CA) or pcDNA 3 
(Invitrogen, Carlsbad, CA) and sequenced. Both fragments 
were observed to be identical to the HTGS sequence, and 
were used as probes to screen a human genomic library as 
previously described (Marchese et al., 1994). Library screen- 
ing retrieved two phage DNA which encoded from the start 
methionine to IC3 and from the carboxyl region of TM6 to 
_the_stop_codon._ Primers_designed^upon_TM5. _(Rl:„5.!-_ 
GCTTCGTGCTGCCGCTG-3 ') and TM7 (P2: 5'-CGGAG- 
C AG AG AGT ACGTG-3 ') were used to PCR amplify Mara- 
thon ready human fetal cDNA (Clontech, Palo Alto, CA) to 
retrieve a fragment sharing 100% identity in regions of over- 
lap with the HTGS and human genomic phage DNA 
sequences. To obtain the complete intronless ORF of this 
gene, three overlapping segments encoding GPR78 were 
obtained by PCR. Fragment 1 (encoding from the start 
methionine to TM5) was amplified from human genomic 
DNA using two primers (P3: 5'-GCGCCATGGGCCCCG- 
GCGAGG-3', P4: 5 '-GGTG ACGGTGTCC ATGCGC-3 '). 
Fragment 2 was amplified using primers PI and P2 from 
human fetal cDNA (described above). Fragment 3 (encoding 
from the third extracellular loop and extending 3' of the stop 
codon) was amplified from human genomic DNA using two 
primers (P5: 5 '-CTGGCGG AGCTCGTGCCC-3 ', P6: 5'- 
GGCCAGTGCCCTTTCCAC-3'). These DNA fragments 
were joined by two further rounds of PCR. Round one 
consisted of fragments 1 and 2 together undergoing 30 cycles 
of 94°C for 30 s, 50°C for 30 s, and 72°C for 1 min using 
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primers P2 and P3. A second round of PCR amplified an 
aliquot of the first round with fragment 3 using primers P3 
and P6 at similar cycling conditions. The PCR products were 
subcloned into the EcoRV site of pcDNA 3 (Invitrogen) and 
sequenced. 

GPR103 was originally obtained in two overlapping frag- 
ments from the EST database (encoding from TM4 to TM7, 
GenBank Accession number: AI307658) and the HTGS 
database (encoding from TM6 to the stop codon, GenBank 
Accession number: AC005961). Analysis of the EST clone 
obtained from the I.M.A.G.E. Consortium revealed that this 
fragment encoded the receptor from TM2 to TM7. Based on 
these sequences, two DNA fragments encoding from TM3 to 
TM7 and from TM7 to the stop codon were obtained by PCR 
from human hypothalamus cDNA (Clontech) and human 
genomic DNA, respectively. PCR products were purified 
and ligated into pBluescript as described above. Sequence 
analysis revealed both DNA fragments to be identical with 
their respective database sequences. The fragment encoding 
TM3 to TM7 was used as a probe to screen a human hypotha- 
lamus cDNA library (Clontech), which retrieved a phage 
encoding GPR103 from the start methionine to TM3 sharing 
100% identity in the overlapping region with the EST derived 
sequence: To obtain a complete intronless ORF of this gene, 
the three overlapping fragments were joined by PCR as 
described above. The PCR products were subcloned into 
the EcoRV site of pcDNA 3 and sequenced. 

To obtain DNA encoding other GPCRs, human genomic 
DNA was amplified by PCR using the following oligonu- 
cleotides: $GPR79 (5 '-TGGGGC AG AGGCTG ATGCC A- 
TGC-3 ', 5 '-AGCTGG ATGCTC ACCC AACTTGTTC-3 '), 
GPRSO (5 '-G ATTC AT ATTGCC AA ACTG AAC-3 ' , 5'-C- 
ATCCTGAACATCTTAGGATG-3'), GPR81 (5'-CTAA- 
CGCTCAGATAAGCATCTGTG-3', 5'-GTCACCACTC- 
TATCTTCCTCAGTG-3'), GPR82 (5 AATTCT ATTCT- 
AGCTCCTGTG-3^, 5 ^CT AAT AAAGTC AC ATGAATG-_ 
C-3'), GPR93 (5 '-TTTGGCACGATGTTAGCC-3', 5'-G- 
TTC AG AGGGCGG AATCC-3 '), GPR94 (5'-AAGCAAT- 
G AAC ACC AC AGTG ATGC-3 5 '- ATTATCT ACGG AA- 
GTCTCATC-3'), GPR95 (5'-AGTTGGGTCTGTAAGG- 
G AACC-3 ', 5 '-TTT ATTT AC ACTTTGT ACAT ATCG-3 '), 
GPR101 (5 '-CTGGCTGTTGCC ATG ACGTCC-3 5'-GC- 
CTTAGAACTAACTTCAAGG-3'), GPR102 (5'-CAAA- 
CAAC AAACAGCAG AACC-3 ', 5 '-CTTAGTGCTT AAA- 
CTTATTC-3') and P2Y t2 (5 '- AAATA ACCATCCTCTC- 
TTTTGTTC-3', 5 '-CG AGTTCTG A AC AC AAAG AG AT- 
TG-3'). PCR conditions were as follows: denaturation at 
94°C for 30 s, annealing at 50°C for 30 s, and extension at 
72°C for 1 min. PCR products were purified, ligated into 
pcDNA3 or pBluescript vectors and sequenced as described 
above. 

2.5. Northern blot analyses 

Human and rat mRNA were extracted from various 
tissues as described previously (Marchese et al., 1994). 



Briefly, total RNA was extracted by the method of Chomc- 
zynski and Sacchi (1987), and poly (A) + RNA was isolated 
using oligo(dT) cellulose spin columns (Pharmacia, 
Uppsala, Sweden). RNA was denatured and separated by 
electrophoresis on a 1% formaldehyde agarose gel, trans- 
ferred onto nylon membrane and immobilized by UV irra- 
diation. The blots were hybridized with human GPCR- 
encoding 32 P-labeled DNA fragments, washed with 2X 
SSPE (3 M NaCl, 0.2 M NaH 2 P0 4 , 0.02 M EDTA) and 
0.1% SDS at 50°C for 20 min, washed again with 0.1 X 
SSPE and 0.1% SDS at 50°C for 2 h and exposed to X- 
ray film at — 70°C in the presence of an intensifying screen. 
In addition, two human Multiple Tissue Northern (MTN™) 
blots (Clontech) were used in GPR78 (Human MTN Blot) 
and GPR95 (Human MTN Blot IV) expression analyses 
according to the manufacturer's instructions. 



3. Results 

3 J. Cloning of GPCR-encoding genes 

A search of the HTGS database with the GPR26 sequence 
retrieved a novel human genomic sequence encoding a 
GPCR localized to chromosome 4 (GenBank Accession 
number: AC007104). The sequence was encoded on two 
fragments, from the start methionine to the middle region 
of IC3 and from the carboxyl region of TM6 to the stop 
codon. Primers were designed to PCR amplify human geno- 
mic DNA which retrieved fragments encoding TM1 to TM4 
and from TM6 to the stop codon. These fragments were 
used to screen a human genomic library and two phage 
DNA fragments were retrieved. We also used TM5- and 
TM7-specific primers to amplify a DNA fragment from 
human fetal cDNA. The cDNA PCR product revealed 
.100% .identity _in . ^regions .overlapping, the _ fragments^ 
previously obtained from genomic DNA, confirming these 
fragments as segments of the same gene. To obtain the full- 
length ORF, the three overlapping fragments were joined by 
PCR, and this clone was named GPR78 (Fig. 1). GPR78 
encoded a 363 aa protein which shared extensive identities 
in the TM regions with GPR26 (56%) (Table 1). 

A customized search of the HTGS database retrieved a 
human genomic sequence (GenBank Accession number: 
AC021773) apparently encoding a novel GPCR. However, 
the sequence (if/GPR79) contained a frame-shift in the ORF. 
Primers were designed and used to PCR amplify this region 
of DNA sequence, and the product sequenced to verify the 
frame-shift in the TM5-encoding region confirming a pseu- 
dogene {\ffGPR79). if/GPR79 was used to search the HTGS 
database, which retrieved a related GPCR-encoding human 
genomic sequence localized to chromosome 13 (GenBank 
Accession number: AC026756). The PCR product contain- 
ing this gene revealed an ORF of 336 aa in length, which we 
named GPR80. A search of the nr database with the 
projected if/GPR79 aa sequence revealed significant identi- 
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Fig. 1 . oGPCR sequence alignments. Amino acid comparisons between novel GPCRs. The TM domain regions are indicated, and amino acids are numbered on 
the right. Black boxes with white lettering indicate conserved residues for purinergic binding (Erb et al., 1995; Jiang et ah, 1997; Hoffmann et al., 1999). The 
presence of introns interrupting the GPR78m<\ GPR95 gene sequences is indicated by 'V and *¥ \ respectively. The presence of a frame-shift interrupting the 
ipGPR79 sequence is indicated by Residues shared between at least three aligned GPCR sequences are shaded. 



ties in the TM regions with the purinoceptors P2Y 2 (51%), 
P2Y 4 (50%) and P2Y 6 (43%). A similar search with the 
GPR80 sequence revealed greatest identities in the TM 
regions with the purinoceptors P2Yj (45%) and P2Y 4 
(39%) and the cysteinyl leukotriene receptor CysLT2 
(39%). 



From the HTGS database we retrieved a GPCR-encoding 
sequence contained on a bacterial artificial chromosome 
(BAG) clone localized to chromosome 12q (GenBank 
Accession number: AC026331). The PCR amplified product 
(GPR8J) encoded a 347 aa protein with identities in the TM 
regions with HM74 (70%), GPR31 (43%) and the purino- 
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Table I 

oGPCR sequence identities 8 



Receptor 


% Identity 


Accession number 


Receptor 


% Identity 


Accession number 


GPR78 


GPR26, 49 (56) 
SSTR4,21 (29) 
5HT 6 , 20 (25) 


AF411107 


GPR95 


UDP-glucose, 42 (62) 
P2Y.2, 39 (54) 
GPR94, 37 (51) 


AF11114 


GPR80 


P2Y„31 (45) 
P2Y 4 , 29 (39) 
CysLT2, 31 (39) 


AF411109 


GPR101 


RE2, <20 (31) 
5HT 1A , <20 (29) 
a IA , <20 (29) 


AF411I15 


GPR81 


HM74, 50 (70) 
GPR3 1 , 30 (43) 
P2Y,, 23 (37) 


AF411110 


GPR 102 


PNR, 40 (42) 
GPR57, 36 (35) 
GPR58, 33 (35) 


AF411116 


(jrKoZ 


UrK.34, Zj 
GPR17, 24 (30) 
SSTR2, 23 (30) 


Ar41 1111 


OrKIUJ 


INrrrZ, j 1 \jo) 
NPY2, 29 (37) 
GalRl,30 (35) 


A R/l 11117 
Ar41 111/ 


GPR93 


P2Y 5 , 31 (40) 
GPR23, 28 (38) 
GPR 1.7, 25 (36) 


AF411112 


ij/GPR79 


P2Y 2 , 37(51) 
P2Y 4 , 37 (50) 
P2Y 6 , 33 (43) 


AF411108 


GPR94 


P2Yi 2 , 46 (57) 
UDP-glucose, 41 (52) 
GPR95, 37 (51) 


AF4i i i 13 









a % Identity represents the shared sequence identities of each receptor with three of the closest GPCR sequences (TM identities are in parentheses). Each 
novel oGPCR sequence can be accessed through GenBank (http://www2.ncbi.nlm.nih.gov/genbank/query_form.html) with the listed Accession numbers. 



ceptor P2Y1 (37%). The HM74 sequence was retrieved in 
the same BAC clone as GPR81, indicating a clustering of 
these two genes. 

GPR82 was retrieved using GPR34 (from a search of the 
HTGS database) on a human BAC clone localized to chro- 
mosome 1 (GenBank Accession number: AL161458). The 
GPR82 PCR product encoded a 336 aa protein sharing iden- 
tities in the TM regions with the oGPCRs GPR34 (31%) and 
GPR 17 (30%) and the somatostatin receptor SSTR2 (30%). 

A search of the HTGS database using the cysteinyl leuko- 
triene 2 receptor sequence retrieved a sequence (GPR93) 
encoding a GPCR on a human BAC clone localized to 

chrom osome 1 2 (GenBank Accession number:_ 

AC006087). The PCR product obtained encoded a 372 aa 
protein which shared identities in the TM regions with the 
oGPCRs P2Y 5 (40%), GPR23 (38%) and GPR 17 (36%). 

A search of the HTGS database retrieved a cluster of three 
genes encoding GPCRs within the same contig localized to 
chromosome 3 (GenBank Accession number: AC024886). 
One of these genes was recently reported to encode the 
platelet ADP receptor, P2Y, 2 (Zhang et al„ 2001 ; Hollopeter 
et al., 2001). The P2Y 12 receptor gene shared homology 
with the other two genes, which we named GPR94 and 
GPR95. A PCR product (GPR94) encoded a 333 aa protein 
which shared identities in the TM regions with the P2Y U 
receptor (57%), the UDP-glucose receptor (52%) and the 
receptor encoded by GPR95 (5 1 %). GPR95 encoded a trun- 
cated GPCR, from TM1 to the stop codon, with an intron in 
the ORF evident by the lack of start methionine and the 
presence of an upstream in-frame stop codon. A search of 
the EST database retrieved one EST sequence from human 
testis mRNA encoding GPR95 which also lacked a start 
methionine codon (GenBank Accession number: 



AA758208). This EST was acquired from the I.M.A.G.E. 
Consortium, sequenced, and confirmed to share 100% iden- 
tity with the genomic sequence. A more recent database 
search retrieved an EST sequence from a human bladder 
cell line encoding GPR95 with an alternative 5' coding 
region (GenBank Accession number: BF028445), confirm- 
ing a complete GPR95 ORF and the presence of an intron 
between the start methionine and TM1 -encoding regions. 
GPR95 encoded a 358 aa protein which shared highest iden- 
tities in the TM regions with the UDP-glucose receptor 
(62%), the P2Y 12 receptor (54%) and the receptor encoded 
by GPR94{5\%). 

_We used the histamine-HUand-H4 receptor-sequences to 

retrieve two GPCR-encoding sequences, GPR101 and 
GPR102, respectively, from the HTGS database localized 
to chromosome X (GenBank Accession number: 
AL390879) and chromosome 6 (GenBank Accession 
number: AL357505), respectively. A PCR amplified 
product (GPR10I) encoded a 508 aa protein with identities 
in the TM regions with the oGPCR RE2 (31%), the seroto- 
nin 5HT IA receptor (29%) and the a 1A adrenergic receptor 
(29%). The GPR102 PCR product encoded a 342 aa protein 
sharing identities in the TM regions with the oGPCRs PNR 
(42%), GPR57 (35%) and GPR58 (35%). 

A search of the EST and HTGS databases with the rat 
leukotriene LT2 receptor sequence retrieved human DNA 
sequences encoding a novel GPCR, GPR 103. The novel 
receptor sequence was encoded on two overlapping frag- 
ments from a human kidney EST (encoding from TM2 to 
TM7) and from a human HTGS sequence (encoding from 
TM6 to the stop codon). Primers were designed to PCR 
amplify human hypothalamus cDNA and genomic DNA 
which retrieved two fragments encoding TM3 to TM7 and 
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from TM6 to the stop codon, respectively. The TM3 to 
TM7-encoding fragment was used to screen a human 
hypothalamus cDNA library, which retrieved a phage 
DNA fragment encoding GPR103 from the start methionine 
to TM3. The three DNA fragments revealed 100% identity 
in regions of overlap, confirming these fragments as 
segments of the same gene. To obtain the full-length 
ORF, the three overlapping fragments were joined by 
PCR, and this clone was named GPR103. GPR103 encoded 
a 455 aa protein which shared identities in the TM regions 
with various peptide receptors, including the neuropeptide 
FF 2 (38%), neuropeptide Y2 (37%) and galanin GalRl 
(35%) receptors. 

3.2. Expression analyses 

GPR78 rnRNA transcripts were detected in human pitui- 
tary (1.1 kb) and placenta (two signals of 4.2 and 1.1 kb in 
size) (Fig. 2A,B). However, no GPR78 transcripts were 
observed in human brain or specific CNS regions such as 
the frontal cortex, putamen, thalamus, hypothalamus, amyg- 
dala, hippocampus, pons, medulla and midbrain. In addi- 
tion, human GPR78 transcripts were also absent from 
skeletal muscle, lung, heart, liver, pancreas and kidney. 

To determine the expression distribution of these novel 
oGPCRs, we performed Northern blots in various human 
and rat tissues. tyGPR79 mRNA transcripts were not 
detected in human brain tissues, including frontal cortex, 



basal forebrain, pituitary, caudate nucleus, nucleus accum- 
bens or hippocampus. In addition, Northern analyses did not 
reveal ij/GPR79 in rat brain, fetus, liver, spleen or adrenal 
gland tissue. Similarly, GPR80 mRNA transcripts were not 
detected in human brain tissues including the frontal cortex, 
caudate putamen, thalamus, hypothalamus, hippocampus or 
pons tissue. 

An mRNA transcript was detected for GPR8I in human 
pituitary tissue, with an absence of signal in frontal, 
temporal and occipital lobes of the cortex, basal forebrain, 
caudate nucleus, nucleus accumbens, and hippocampus 
(Fig. 2C). Analyses of GPR82 mRNA expression revealed 
no transcripts in human tissues, including regions of the 
CNS such as the frontal cortex, caudate putamen, thalamus, 
hypothalamus, hippocampus, pons and liver tissue. A search 
of the GenBank database retrieved an EST from cells 
derived from human colonic tissue encoding GPR82 
(GenBank Accession number: BF335802). The GPR93 
probe failed to detect mRNA transcripts in human frontal 
cortex, basal forebrain, caudate putamen, thalamus, or 
hippocampus. A search of the GenBank database revealed 
an EST encoding GPR93 from human primary tonsil B-cells 
(GenBank Accession numbers: BF975186, BF663176 and 
BF129117) and an EST encoding a mouse GPR93 ortholo- 
gue expressed in the small intestine (GenBank Accession 
numbers: AV064817 and AV064680). 

GPR94 mRNA transcripts of 3.2 kb were detected in 
human CNS tissues including the frontal cortex, caudate 
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Fig. 2. oGPCR tissue distribution analyses. Northern blot of poly(A) + RNA (10 u-g/lane), except for (A) and (E) which contained 2 p,g/lane. (A) Human 
MTN™ and (B) human tissue distribution of GPR78. (C) GPR81 and (D) GPR94 human CNS tissue distribution. (E) Human MTN™ and (F) rat tissue 
distribution of GPR95, hybridized with a radiolabeled fragment encoding human GPR95. (G) P2Y, 2 and (H) GPRI01 human CNS tissue distribution. (I) 
Human tissue distribution of GPR103. 
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putamen and thalamus (Fig. 2D). GPR94 was not detected 
in the hippocampus, pons or cerebellum and P2Yi 2 was not 
detected in the hippocampus, cerebellum or in peripheral 
liver tissue. GPR95 transcripts were observed in both 
human and rat peripheral tissue. A 1.8 kb signal was 
detected in human prostate, rat stomach and rat fetal tissues 
(Fig. 2E,F). However, GPR95 expression was not observed 
in human thalamus, hypothalamus, hippocampus, pons or 
cerebellum or in rat whole brain tissue. In addition, we 
performed Northern analyses of P2Y i2 transcripts in 
human CNS tissues, which revealed faint signals of 2.7 kb 
in the frontal cortex, caudate putamen, thalamus, hypotha- 
lamus and pons (Fig. 2G). GPR101 mRNA transcripts of 9.5 
and 4.2 kb were detected in the caudate putamen and 
hypothalamus, with no expression detected in the frontal 
cortex, thalamus, hippocampus and pons (Fig. 2H). 
GPR1G3 transcripts of 4.0, 2.6 and 1.4 kb were detected 
in the thalamus and hypothalamus, with a further 9.5 kb 
signal in the hypothalamus and a 1 .4 kb signal in the pitui- 
tary (Fig. 21). In addition, GPR103 transcripts were also 
observed in the frontal and occipital cortices, basal fore- 
brain, midbrain and pons (data not shown). 



4. Discussion 

Currently, —350 human GPCRs have been cloned, as 
listed on the GPCRDB (G protein-coupled receptor data- 
base, http://www.gpcr.org/7tm/), with —250 representing 
family A (or rhodopsin-like) GPCRs (Lee et al., 2001a). 
These receptors total approximately half the predicted 616 
GPCR-encoding sequences contained in the human genome 
(Venter et al., 2001), although the veracity of this total 
number remains to be confirmed. 

The identification of genes encoding the novel GPCRs 
-predicts the existence of-novel-signaling systems-leading to — 
the discovery of novel ligands, as demonstrated by recent 
reports describing the discovery of apelin (Tatemoto et al., 

1998) , prolactin-releasing peptide (Hinuma et al., 1998), 
orexin (Sakurai et al., 1998), melanin-concentrating 
hormone (Bachner et al., 1999; Chambers et al., 1999; 
Lembo et al., 1999; Saito et al., 1999; Shimomura et al., 

1999) and urotensin II (Ames et al., 1999; Liu et al., 1999; 
Mori et al., 1999; Nothacker et al., 1999) receptor-ligand 
systems. As these GPCR genes (many of which were cloned 
in our laboratory) were used in methods that led to the discov- 
ery and identification of these ligands (Lee et al., 2001a), we 
are continuing to isolate- and characterize these novel genes. 

We now report the discovery of ten novel oGPCRs and a 
pseudogene. GPR78 is a paralogue of GPR26, apparent 
from shared identities (56% in the TM regions), a lack of 
asparagine-linked extracellular glycosylation sites, a short 
amino terminus, and similar gene structure (Fig. 1). GPR78 
and GPR26 encoded receptors with shared conserved catio- 
nic arginine and lysine residues in TM6 and TM7, respec- 
tively, two residues recognized to play a role in purinergic 



binding and found only in P2Y receptors (Erb et al., 1995; 
Jiang et al., 1997) (Fig. 1). However, we reported calcium 
mobilization assays of human astrocytoma 1321N1 cells 
and Xenopus laevis oocytes transfected with GPR26 were 
not responsive to nucleoside di- and tri-phosphates (Lee et 
al., 2000). We reported high levels of GPR26 expression in 
many brain regions (Lee et al., 2000), while GPR78 was 
detected only in the pituitary and placenta. However, the 
overall structural homology suggested that GPR78 and 
GPR26 may encode receptors that share a common endo- 
genous, ligand. 

GPR80, GPR8], GPR82, GPR93 and ^GPR79 all shared 
identities to P2Y GPCRs or P2Y-like oGPCRs. Previously, at 
least three different nucleotide receptor phenotypes have 
been observed in mammalian tissue, including GPCRs acti- 
vated by adenine nucleotides (e.g. P2Yj and P2Y M ), uridine 
nucleotides (e.g. P2Y 6 ) and by both adenine and Liridine (e.g. 
P2Y 2 and P2Y 4 ) (King et al., 1998). tyGPR79 shared closest 
identity with P2Y 2 and P2Y 4 , even though it does not encode a 
functional GPCR. GPR80 was observed to share highest 
identity with P2Y b while GPR81 , GPR82 and GPR93 shared 
identities with the P2Y-like oGPCR genes HM74, GPR34 
and P2Y 5 , respectively. Some of these oGPCR genes encode 
aa residues conserved amongst the P2 Y receptors and shown 
to be involved in purinergic ligand binding (Erb et al., 1995; 
Hoffmann et al., 1999; Jiang et al., 1997) (Fig. 1). While 
expression was not observed for GPR80, GPR82 and 
GPR93 in various CNS regions, GPR81 was observed to 
have an mRNA transcript in the pituitary, suggesting a role 
in neuroendocrine regulation. 

The identities of GPR94 and GPR95 with the genes 
encoding the UDP-glucose and P2Yj 2 receptors (>50% in 
the TM regions) indicate a novel subfamily of purinergic- 
like receptors. Previously, the UDP-glucose receptor was 
reported to have a distant sequence homology with the 
— P2^receptors,-with an observed widespread tissue distribu-- 
tion in human brain and such peripheral tissues as placenta, 
adipose tissue, spleen, intestine, stomach, skeletal muscle, 
lung and heart (Chambers et al., 2000). The identification of 
the platelet ADP (P2Yj 2 ) receptor resulted from cDNA 
isolated from rat platelets and human hypothalamus 
(Zhang et al., 2001; Hollopeter et al., 2001). An alignment 
of these receptors with the GPCRs encoded by GPR94 and 
GPR95 revealed several residues conserved in P2Y purino- 
ceptors (see above) also conserved throughout this novel 
purinoceptor-like subfamily (Fig. 1). Northern analysis of 
GPR94 mRNA revealed expression in various regions of the 
brain, suggesting a neuromodulatory role. In contrast, 
GPR95 mRNA was detected in peripheral tissue (i.e. 
human prostate and rat stomach). 

The receptor encoded by GPRJ0J appeared to be a distant 
relative of the biogenic amine superfamily of GPCRs, with 
TM identities of —30% with the adrenergic and serotonin 
receptors, as well as the muscarinic and dopamine receptors 
(data not shown). GPR101 mRNA transcripts were observed 
in brain tissue, suggesting the presence of an endogenous 
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amine neurotransmitter ligand, perhaps novel in identity. 
The receptor encoded by GPR102 shared significant TM 
identities with an amine binding receptor-like GPCR family 
(including PNR, GPR57 and GPR58) suggesting they may 
also share a common endogenous ligand. The receptor 
encoded by GPR103 shared highest identities with several 
neuropeptide receptors. The significant levels of GPR103 
expression in the brain, particularly in the thalamus and 
hypothalamus, suggest an endogenous peptide ligand, 
perhaps involved in physiological functions such as pain 
modulation and neuroendocrine regulation. 

Interestingly, some of these novel oGPCR genes appear 
to be clustered on various human chromosomes. The GPR81 
gene was localized to chromosome 12q, proximal to the 
closely related oGPCR gene, HM74. The GPR82 gene 
was retrieved together with the GPR34 gene in a human 
BAG clone localized on chromosome i. Another search of 
the HTGS database revealed GPR94, GPR95 and P2Y J2 
localized within the same BAC clone on chromosome 3. 
This cluster of genes also includes the UDP-glucose recep- 
tor gene, which together with the more distant P2Yj receptor 
gene further localizes this cluster to chromosome 3q24-25 
(interval D3S 1279- 1280) (Hollopeter et al., 2001). We have 
previously reported clusters of homologous GPCRs genes, 
including the GPR40 through GPR43 gene cluster (Sawz- 
dargo et al., 1997) as well as the 5-HT 4 -like pseudogene, 
ij/GPR57, GPR58 and PNR gene cluster (Lee et al., 2000). 
Given the significant sequence similarity of GPRI02 with 
PNR, GPR57 and GPR58 and its localization on chromo- 
some 6, GPR102 may be another paralogue member of this 
gene cluster. 

In conclusion, we have identified ten novel GPCR genes 
and a pseudogene. Transcripts for GPR78, GPR8J, GPR94, 
GPR95, GPRJ0J, GPRI03 and P2Y n were detected in 
various CNS and peripheral tissues. Given the high levels 
~ofidentity observed - within-paralogous-oGPCR' gene-clus- - 
ters, future efforts will likely discover common endogenous 
ligands for each of these novel GPCR subfamilies. The 
increasing number of oGPCRs that continue to be isolated 
with unique distribution profiles in brain and periphery is 
indicative that the search for novel transmitter ligands 
should be intensified. These efforts have the tremendous 
potential to uncover novel physiological roles for these as 
yet unknown receptor-transmitter signaling systems. 
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